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Abstract. In the late seventies, Clark [6] pointed out that it would be natural for 7Tt, the solution 
of the stochastic filtering problem, to depend continuously on the observed data Y = {Y s ,s 6 [0, t]}. 
Indeed, if the signal and the observation noise are independent one can show that, for any suitably 
chosen test function /, there exists a continuous map d{ , defined on the space of continuous paths 
C([0, t], M. d ) endowed with the uniform convergence topology such that vrt(/) = 9l(Y), almost surely 
(see, eg. [6, 7, 10, 11, 18]). As shown by Davis [12, 13, 14], this type of robust representation is also 
possible when the signal and the observation noise are correlated, provided the observation process is 
scalar. For a general correlated noise and multidimensional observations such a representation does not 
exist. By using the theory of rough paths we provide a solution to this deficiency: The observation 
process Y is "lifted" to the process Y that consists of Y and its corresponding Levy area process and 
we show that there exists a continuous map 8^ , defined on a suitably chosen space of Holder continuous 
paths such that nt{f) = 00, almost surely. 



1. Introduction 

Let (f2, J-, (J-t)t>o-, P) be a filtered probability space on which we have defined a two component diffusion 
process (X, Y) solving a stochastic differential equation driven by a multidimensional Brownian motion. 
One assumes that the first component X is unobservable and the second component Y is observed. The 
filtering problem consists in computing the conditional distribution of the unobserved component, called 
the signal process, given the observation process Y. Equivalently, one is interested in computing 

n t (f)=E[f(X t ,Y t )\y t ], 

where y = {yt,t > 0} is the observation filtration and / is a suitably chosen test function. An elementary 
measure theoretic result tells us 1 that there exists a Borel-measurable map 0{ : C([0,£],R dv ) — > R, such 
that 

(1) 7r t (p) = 6{(Y) P-a.s., 

where dy is the dimension of the observation state space and Y. is the path- valued random variable 

Y. : Q O([0,t],M. dY ), Y.{uj) = (Y s (u), < s < t). 
Of course, &{ is not unique. Any other function 6 t such that 

Poy- 1 (o{^6{) =o, 



where P o Yr 1 is the distribution of Y. on the path space C([0, t], M. dY ) can replace 0{ in (1). It would 
be desirable to solve this ambiguity by choosing a suitable representative from the class of functions that 
satisfy (1). A continuous version, if it exists, would enjoy the following uniqueness property: if the law 
of the observation Po Yr 1 positively charges all non-empty open sets in C([0,t),M. dY ) then there exists a 
unique continuous function Q{ that satisfies (1). In this case, we call 0{(Y.) the robust version of irt((f) 
and equation (1) is the robust representation formula for the solution of the stochastic filtering problem. 



Key words and phrases, filtering, robustness, rough path theory. 
^See, for example, Proposition 4.9 page 69 in [4]. 
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The need for this type of representation arises when the filtering framework is used to model and solve 
'real-life' problems. As explained in a substantial number of papers (e.g. [0, 7, 10, 11, 12, 13, 14, 18]) 
the model chosen for the "real-life" observation process Y may not be a perfect one. However, as long as 
the distribution of Y. is close in a weak sense to that of Y. (and some integrability assumptions hold) , the 
estimate 0{(Y.) computed on the actual observation will still be reasonable, as E[(/(X(, Yt) — 9{(Y.)) 2 } is 
well approximated by the idealized error E,[(f(X t ,Y t ) — 9{(Y.)) 2 ]. 

Moreover, even when Y and Y actually coincide, one is never able to obtain and exploit a continuous 
stream of data as modelled by the continuous path Y.{u). Instead the observation arrives and is processed 
at discrete moments in time 

= t < tx < t 2 < ■ ■ ■ < t n = t. 

However the continuous path Y.(ui) obtained from the discrete observations (Y ti (w))™ =1 by linear interpola- 
tion is close to Y.(uj) (with respect to the supremum norm on C([0, t], K dv ); hence, by the same argument, 
0{ (Y.) will be a sensible approximation to 71* (<^). 

In the following, we will assume that the pair of processes (X, Y) satisfy the equation 

(2) dX t = l (X u Y t )dt + Y,Zk(X t ,Y t )dW t k +Y,L j {XuY t )dBi 

k j 

(3) dY t = h(X u Y t )dt + dW t , 

with Xo being a bounded random variable and Yq = 0. In (2) and (3), the process X is the dx~ 
dimensional signal, Y is the dy-dimcnsional observation, B and W are independent ds-dimcnsional, re- 
spectively, dy-dimcnsional Brownian motions independent of X$. Suitable assumptions on the coefficients 
l , L u ..., L dB : R d *+ d Y — > R d * , Z x , . . . , Z dY : E. dx+dY -> R dx and h = {h\ . . . , h dv ) : R d ^+ d ^ -> W 1y 
will be introduced later on. This framework covers a wide variety of applications of stochastic filtering 
(see, for example, [9] and the references therein) and has the added advantage that, within it, ir t (tp) admits 
an alternative representation that is crucial for the construction of its robust version. Let us detail this 
representation first. 

Let u = {ut , t > 0} be the process defined by 

(4) u t =e^(-^j\\X s ,Y s )dWi-^j\h i (X s ,Y s )) 2 ds^j . 

Then, under suitable assumptions 2 , u is a martingale which is used to construct the probability measure 
Po equivalent to P on Uo<t<oo J~t whose Radon-Nikodym derivative with respect to P is given by u, viz 



dPo 
dP 



= u t . 



Under Po, Y is a Brownian motion under independent of B. Moreover the equation for the signal process 
X becomes 

(5) dX t = lo(X u Y t )dr + ]T Z k (X t ,Y t )dY t k + L 3 (X u Y t )dBl 

k 3 

Observe that equation (5) is now written in terms of the pair of Brownian motions (Y, B) and the coefficient 
Iq is given by Iq = Iq + J2k ^khk- Moreover, for any measurable, bounded function / : M. dx +dY — > R, we 
have the following formula called the Kallianpur-Striebel's formula, 

(6) n t (f) = ^, p t (f):=E [f(X t ,Y t )v t \y t ] 



2 For example, if Novikov's condition is satisfied, that is, if E j^exp^ J* \\h 1 (X a , Y a )\\ 2 ds^ < oo for all t > 0, then u is a 
martingale. In particular it will be satisfied in our setting, in which h is bounded. 
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where v = {v t ,t > 0} is the process defined as v t := cxp(/ t ), t > and 

(7) I t := ^ Qf h*(Jr rj y r )dl? - ^ jT (V'(JT r ,y P )) a dr^ , * > 0. 

The representation (6) suggests the following three-step methodology to construct a robust representation 



formula for n t 

Step 1: . We construct the triplet of processes (X v , Y v , I v ) 3 corresponding to the pair (y, £?) where 
y is now a fixed observation path y_ — {y s ,s £ [0,i]} belonging to a suitable class of continuous 
functions and prove that the random variable f(X v ,Y y )exp(I y ) is Po-integrable. 

Step 2: . We prove that the function y. — > g{(y.) defined as 

(8) ff /(y.)=E [/(^,y/)exp(^)] 
is continuous. 

Step 3: . We prove that g{ (Y.) is a version of Pt(f)- Then, following (6), the function, y. — > 0{(y ) 
defined as 



(9) 



nf _ 9t 

1 il 



provides the robust version of of 7r*(/)- 

We emphasize that Step 3 cannot be omitted from the methodology. Indeed one has to prove that g((Y.) 
is a version of Pt(f) as this fact is not immediate from the definition of g{ . 

Step 1 is immediate in the particular case when only the Brownian motion B drives X (i.e. the coefficient 
Z = 0) and X is itself a diffusion, i.e., it satisfies an equation of the form 

(10) dX t = h{X t )dr + Y,L 3 {X t )dBl 

j 

and h does only depend on X. In this case the process (X V ,Y V ) can be taken to be the pair (X,y). 
Moreover, we can define I v by the formula 



<=1 V Ja 



' y l r dh*(X r ) --j^ (h l (X r , Y r )Ydr) , t > 0. 

provided the processes h l (X) are semi-martingales. In (11), the integral J* y l r dh l (X r ) is the ltd integral 
of the non-random process y % with respect to h % {X). Note that the formula for I\ is obtained by applying 
integration by parts to the stochastic integral in (7) 

(12) f h l (X r )dY; = h\X t )Y l r - f Y r l dh l (X r ), 

Jo Jo 

and replacing the process Y by the fixed path y in (12). This approach has been successfully used to study 
the robustness property for the filtering problem for the above case in a number of papers ([6, 7, 18]). 

The construction of the process (X v ,Y V , I v ) is no longer immediate in the case when Z ^ 0, i.e. when 
the signal is driven by both B and W (the correlated noise case). In the case when the observation is one 
dimensional one can solve this problem by using a method akin with Doss-Sussmann's "pathwise solution" 
of a stochastic differential equation (see [15], [23]). This approach has been employed by Davis to extend 



As we shall see momentarily, in the uncorrelated case the choice of Y y will trivially be y. In the correlated case we 
make it part of the SDE with rough drift, for (notational) convenience. 
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the robustness result to the correlated noise case with scalar observation (see, [10, 11, 12, 14]). In this 
case one constructs first a diffeomorphism which is a pathwise solution of the equation 4 

(13) <j>{t,x) =x+ [ Z{4>{s,x))°dY t . 

Jo 

The diffeomorphism is used to express the solution X of equation (5) as a composition between the 
diffeomorphism and the solution of a stochastic differential equation driven by B only and whose 
coefficients depend continously on Y. As a result, we can make sense of X y . I v is then defined by a 
suitable (formal) integration by parts that produces a pathwise interpretation of the stochastic integral 
appearing in (7) and Y v is chosen to be y, as before. The robust representation formula is then introduced 
as per (9). 

To our knowledge, the correlated noise and multidimensional observation case has not been studied and it 
is the subject of the current work. In this case in turns out that we cannot hope to have robustness in the 
sense advocated by Clark. More precisely, there may not exists a map continuous map d{ : C([0, t],M. dY ) — > 
R, such that the representation (1) holds almost surely. The following is a simple example that illustrates 
this. 

Example 1. Consider the filtering problem where the signal and the observation process solve the following 
pair of equations 

X t =X + [ X r dY r + - f X r dr 



Jo 2 jo 

Y t = f h(X r )dr + W t , 
Jo 



where Y is 2-dimensional and P(Xo = 0) = P(Xo = 1) = h. Then with f,h such that /(0) = hi(0) 
7*2(0) = one can explicitly compute 

(14) E[f(x t )\y t] = ■ /(cxp(Ft)) - - t 



exp(- ^ / h k (exp(Y r ))dY r k + ^ f ||/i(cxp(Y r )|| 2 dr 
1 — 1 o jo Jo 



fe=l,2 

Following the findings of rough path theory (see, eg, [16, 19, 20, 21] ) the expression on the right hand 
side of (14) is not continuous in supremum norm (nor in any other metric on path space) because of 
the stochastic integral. Explicitly, this follows, for example, from Theorem 1.1.1 in [21] by rewriting the 
exponential term as the solution to a stochastic differential equation driven by Y . 

Nevertheless, we can show that a variation of the robustness representation formula still exists in this 
case. For this we need to "enhance" the original process Y by adding a second component to it which 
consists of its iterated integrals (that, knowing the path, is in a one-to-one correspondance with the Levy 
area process). Explicitly we consider the process Y = {Y t ,t > 0} defined as 

/ / JoX'odY^ ... £Y*odYf \\ 

(15) Y t = It, ••• , t >0. 

V \fiYfodY? ••• SlYfodYf )) 

The stochastic integrals in (15) are Stratonovich integrals. The state space of Y is G 2 (R dY ) = M. dY © 
so(dy), where so(dy) is the set of anti-symmetric matrices of dimension dy. 5 Over this state space we 
consider not the space of continuous function, but a subspace C 0,a that contains paths 77 : [0,t] — > G 2 (R. dY ) 
that are a- Holder in the M. dY -component and somewhat "2a- Holder" in the so(dy )-component, where a 



4 Here d Y = 1 and Y is scalar. 

^More generally, G^ 1 /™! (R^) is the "correct" state space for a geometric o-H61der rough path; the space of such paths 
subject to a-H61dcr regularity (in rough path sense) yields a complete metric space under o-H61der rough path metric. 
Technical details of geometric rough path spaces (as found e.g. in section 9 of [16]) are not required for understanding the 
results of the present paper. 
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is a suitably chosen constant a < 1/2. Note that there exists a modification of Y such that Y(uj) G C 0,a 
for all uj (Corollary 13.14 in [16]). 

The space C°' a is endowed with the a-Holder rough path metric under which C 0,a becomes a complete 
metric space. The main result of the paper (captured in Theorem 6 and Theorem 7) is that there 
exists a continuous map 0{ : C a — > R, such that 

(16) n t ((p) = 6 f t (Y.) P-a.s. 

Even though the map is defined on a slightly more abstract space, it nonetheless enjoys the desirable 
properties described above for the case of a continuous version on C([0,i],M d ). Since PoY" 1 positively 
charges all non-empty open sets of C°' a , 6 the continuous version we construct will be unique. Also, it 
provides a certain model robustness, in the sense that E{(<p(X t ) — 6*/(Y.)) 2 ] is well approximated by 
the idealized error K[(ip(X t ) — 6*/(Y.)) 2 ], if Y. is close in distribution to Y.. The problem of discrete 
observation is a little more delicate. It is true, that the rough path lift Y calculated from the linearly 
interpolated Brownian motion Y will converge to the true rough path Y in probability as the mesh goes 
to zero (Corollary 13.21 in [16]), which implies that 8{ (Y) is close in probability to 0{ (Y) (we provide 
local Lipschitz estimates for 6?). In effect, most sensible approximations will do, as is for example shown 
in Chapter 13 in [16] (although, contrary to the uncorrelated case, not all interpolations that converge in 
uniform topology will work, see e.g. Theorem 13.24 ibid). But these are probabilistic statements, that 
somehow miss the pathwise stability that one wants to provide with 6{ . If on the other hand, one is able 
to observe (at discrete time points), instead of only the process itself, also the second level, i.e. the area, 
one can construct an interpolating rough path by using geodesic interpolation (see e.g. Chapter 13.3.1 
in [16]), which is close to the true (lifted) observation path Y in the relevant metric for all realizations 
Y e C°' a . 

The following is the outline of the paper: In the next section, we enumerate the common notation used 
throughout the paper. In Section 3 we introduce the notion of a stochastic differential equation with rough 
drift, which is necessary for our main result and correspond to Step 1 above. We present it separately of the 
filtering problem, since we believe this notion to be of independent interest. The proof of the existence of 
a solution of a stochastic differential equation with rough drifts and its properties is postponed to Section 
5. Section 4 contains the main results of the paper and the assumptions under which they hold true. Step 
2 and 3 of above mentioned methodology are carried out in Theorems 6 and 7. 

Acknowledgements. The work of DC was partially supported by the EPSRC Grant No: EP/H0005500/1. 
JD is supported by DFG Grant SPP-1324. PKF and HO have received funding from the European Re- 
search Council under the European Union's Seventh Framework Programme (FP7/2007-2013) / ERC 
grant agreement nr. 258237. 

2. NOMENCLATURE 

Lip 7 is the set of 7-Lipschitz 7 functions a : M. m — > M. n where m and n are chosen according to the context. 

G 2 (M. dY ) = M. d © so(dy) is the state space for a dy-dimensional Brownian motion (or, in general for an 
arbitrary semi-martingale) and its corresponding Levy area. 

c o, a ._ c°' Q - H51 ([(U],G 2 (]fr^)) is the set of geometric a-H61der rough paths r\ : [0,t] -> G 2 (R dY ) starting 
at 0. We shall use the non- homogenous metric p a -H6i on this space. 

In the following we will make use of an auxiliary filtered probability space (f2, J 7 , (F , t )j>o,P) carrying a 
ds-dimensional Brownian motion B. 8 



This fact is a consequence of the support theorem of Brownian motion in Holder rough path topology [17], see also 
Chapter 13 in [16]. 

7 In the sense of E. Stein, i.e. bounded fc-th dervative for k = 0, . . . , [7] and 7 — [7] -Holder continuous [7j-th derivative. 

8 We introduce this auxiliary probability space, since in the proof of Theorem 7 it will be easier to work on a product 
space separating the randomness coming from Y and B. A similar approach was followed in the proof of Theorem 1 in [2] 



(i 



D. CRISAN, J. DIEHL, P. K. FRIZ AND H. OBERHAUSER 



Let S° = <S°(f2) denote the space of adapted, continuous processes in M. ds , with the topology of uniform 
convergence in probability. 

For q > 1 we denote by S q = S q (0.) the space of processes X G S° such that 



1/9 

\X\\s* ■= ( E[sup \X t \ q ] I <oo. 



s</ 



3. SDE WITH ROUGH DRIFT 



For the statement and proof of the main results we shall use the notion (and the properties) of an SDE 
with rough drift captured in the following theorems. The proofs are postponed to Section 5. 

As defined above, let (O, T, (F t )t>o, P) be a filtered probability space carrying a de-dimensional Brownian 
motion B and a bounded ds-dimcnsional random vector So independent of B. In the following, we fix 
e G (0, 1) and a G (jt^i \)- Let 77™ : [0,t] — > R dy be smooth paths, such that 77™ — > r] in a-H61der, for 
some r) G C°' a and let S n be a ds-dimensional process which is the unique solution to the classical SDE 

S? = S + f a{S?)dr + f b(S?)dB r + f c{S?)drff, 
Jo Jo Jo 

where we assume that 9 

(al): a G Lip x (M rfs ), h,...,b dB G Lip x (M rfs ) and ci,...,c dv G Lip 4+e (IR d s) 
(al')= a G Lip 1 ^), 6i,...,6«j a G Lip 1 (R ds ) and Cl ,...,c rfy G Lip 5+e (R ds ))- 

Theorem 2. Under assumption (al), t/iere exists a ds -dimensional process S°° G S° such that 

S n -> S°°, in S°. 

In addition, the limit H(tj) := S 00 only depends on rj and not on the approximating sequence. 

Moreover, for all q > 1, 77 G C°' Q it holds that 5(rj) G tS 9 and the corresponding mapping S : C°' Q — S> <S 9 
is locally uniformly continuous (and locally Lipschitz under assumption (al')j. 

Following Theorem 2, we say that S(tj) is a solution of the SDE with rough drift 

(17) E(rj) t =So+ [ a(E(rj) r )dr+ f b(E( V ) r )dB r + [ c(E( V ) r )dr] r . 

Jo Jo Jo 

The following result establishes some of the salient properties of solutions of SDEs with rough drift. Recall 
that (f2, T , Po) carries, as above, the dy-dimensional Brownian motion Y and let Ct = x f2 be the product 
space, with product measure P := Po <8> P. Let S be the unique solution on this probability space to the 
SDE 

(18) S t = S + I a(S r )dr + [ b{S r )dB r + [ c(S r ) o dY r . 

Jo Jo Jo 

Denote by Y the rough path lift of Y (i.e. the enhanced Brownian Motion over Y). 
Theorem 3. Under assumption (al) we have that 

• For every R > 0, q > 1 

(19) sup E[exp(g|S(j7)| oo;[0it] )] < 00. 

\\ri\\ a -H5l<R 

• For Po — a.e. lu 

(20) P[5.(o; J .) = S(y(w)) a (-), s<t] = l. 

9 In the forthcoming publication [8], wc show existence of a solution of an SDE with rough drift under more sets of 
assumptions and provide additional applications of SDEs with rough drift. 
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4. Assumptions and main results 

In the following we will make use of the Stratonovich version of equation (5), i.e. we will consider that 
the signal satisfies the equation 

X t =X + f L (X r ,Y r )dr + V / Z k (X r ,Y r )odY r k + I L 3 (X r ,Y r )dBl, 
Jo u Jo „■ ^0 



(21) 



Y t = f h{X r ,Y r )dr + W t . 
Jo 



where L 3 (x,y) = l J {x,y) ~ \Y,kY,i d x i Zi(x,y)Z l k (x,y) - \Y,k d Vk Z {{x,y). We remind that under P 
the observation Y is a Brownian motion independent of B. 

We will assume that / is a bounded Lipschitz function and we fix e £ (0, 1) a 6 (2+e' l)> ^ > an< ^ -^0 is 
a bounded random vector independent of B and Y. We will use one of the following assumptions 

(Al): Zi,..., Z dY G Lip 4+e , h 1 ,..., h dY G Lip 4+e and L , L 1} ..., L dB G Lip 1 
(Al'): Z u ...,Z dY G Lip 5+£ , h l ,...,h dY G Lip 5+e and L , L x , ■ ■ ■ , L ds G Lip 1 

Remark 4. Assumption (Al) and (Al') lead to the existence of a solution of an SDEs with rough driver 
(Theorem 2). Under (Al) the solution mapping is locally uniformly continuous, under (Al') it is locally 
Lipschitz (Theorem 3). 

Assume either (Al) or (Al'). For T] G C 0,q there exists by Theorem 2 a solution (A r, ,/ TJ ) to the following 
SDE with rough drift 

X? = X + f L (X?, Y?)dr + f Z(X?, Y?)d Vr + V f L^X? ,Y?)dB>, 
Jo Jo j Jo 

(22) Y?=[ t dr lr , 
Jo 

I?= [ h(X?,Y?)dr, r -±y2 I D k h k {X?,Y?)dr. 
Jo 1 k Jo 

Remark 5. Note that formally (!) when replacing the rough path T] with the process Y, X v ,Y V yields 
the solution to the SDE (21) anrfexp(/ t ?7 ) yields the (Girsanov) multiplicator in (6). This observation is 
made precise in the statement of Theorem 2. 



We introduce the functions g' , g 1 , 9 : C a —> K defined as 



^fa):=E[/pc^l?)exp(J?)], g'^^n^Mn 9( v ):=^\, veC a - a . 

9 iv) 

Theorem 6. Assume that (Al) holds, then 9 is locally uniformly continuous. Moreover if (Al') holds, 
then 9 is locally Lipschitz. 

Proof. From Theorem 2 we know that for r/ G C°- Q the SDE with rough drift (22) has a unique solution 
(A 7 ?,y,/ 7 ') belonging to S 2 . 

Let now rj, 77' G C°' a . Denote X = X^ , Y = Y V ,I = L v and analogously for 77'. 



8 



D. CRISAN, J. DIEHL, P. K. FRIZ AND H. OBERHAUSER 



Then 

\gf(r,)-gf(r,)\<E[\f(X t ,Y t )exp(I t )-f(Xl,Y» t )eMl' t )\} 

< E[\f(Xt,Y t )\\exp(It)-exp(Il)\]+E[\f(X t ,Y t ) - f{X' t ,Y()\ expffl] 

< l/UEO exp(/ t ) - cxp(/ t ')|] + E[\f(X t ,Y t ) f{X' t , Y()\ 2 ]^E[\ exp(/ t ')| 2 ] 1/2 

< i/ue[| exp(/ t ) + eMa 2 ] 1/2 n\i t - I't\] 1/2 

+nf(XuYt)-f{KX)\ 2 \ 1/2 mzMi't)\ 2 \ 1 ' 2 



Hence, using from Theorems 2+3 the continuity statements as well as the boundedncss of exponential 
moments, we see that gf is locally uniformly continuous under (Al) and it is locally Lipschitz under (Al'). 

The same then holds true for g 1 and moreover > 0. Hence 9 is locally uniformly continuous under 

(Al) and locally Lipschitz under (Al'). □ 

Denote by Y,, as before, the canonical rough path lift of Y to C 0,a . We then have 
Theorem 7. Assume either (Al) or (Al'). Then 9(Y .) =n t (f), P - a.s. 

Proof Of Theorem 7. To prove the statement it is enough to show that 

g i {Y.)=p t (f) P-a.s. 

which is equivalent to 

g f (Y.)=p t (f) Po-a.s. 

For that, it suffices to show that 

(23) E [ Pt (f)T(Y.)} = E [gf(Y.)r(Y)}, 

for an arbitrary continuous bounded function T : C([0,t],M. dY ) —> M. 

Let (fi, J 7 , P) be the auxiliary probability space from before, carrying an de-dimensional Brownian motion 
B. Let (0, J",P) := (Six!]/® J",P ® P). By Y and X we denote also the 'lift' of Y to fi, i.e. 
Y(u,ui) = Y(cj), Xo(lu,ui) = Xq(uj). Then (Y,B) (on ft under Pq) has the same distribution as (Y,B) 
(on Cl under P). 

Denote by (XJ) the solution on (ft,F,P) to the SDE 

X t =X + [ L (X r ,Y r )dr + ^ f Z k {X r ,Y r ) o dY r k + V / Lj{X r ,Y r )dBl, 
Jo k JO j Jo 

It = J2 f h k (X r , Y r ) o dY r k - \ V / D k h k (X r , Y r )dr. 



Then 



(F,X,I) f ~ (f,A,^ y fe*(X r ,i;)ody r fc -i^; y D k h k (X ri Y r )dr 



k " u fe 
Hence, for the left hand side of (23), 



E [p t (/)T(K)] =E [/(A t ,y t )cxp (^2j\ k (X r ,Y r )odY k D k h k (X ri Y r )dr^j T(Y)} 

= E[f(X t ,Y t )exp(i t )?(Y)] 
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On the other hand, from Theorem 3 we know that for Po — a.e u> 

X Y (ui \uj) t = X t (u),Q) for P- a.e. Q, 
Y Y {u) (u)) t = Y t {u,Cj) for P- a.e. ui, 
I Y ^\uj) t = I t (u,u>) for P- a.e. u>. 

Hence, for the right hand side of (23) we get (using Fubini for the last equality) 
E [gf(Y.)T(Y.)} = E [E[f(X Y - ,Y Y )exp (l Y )}r(Y)} 
= E [E[/(X t ,y t )exp (l t )]?(Y.)] 
= E[f(X t ,Y t )exp (lt)r(Y)}, 

which yields (23). □ 

5. Proof of Theorems 2 and 3 

Proof of Theorem 2. Let rj £ C 0,a be the lift of a smooth path rj. Let S v be the unique solution of the 
SDE 

S? = Sq + f a(S?)dr + f b{S?)dB r + f c(S?)dr) r . 
Jo Jo Jo 

Define S* 1 := (<f ) _1 (*= S?), where <^ is the ODE flow 
(24) <f>"(t,x)=x + [ c(<f>"(r,x))dn r . 

By Lemma 9, we have that S v satisfies the SDE 

r t 

?v _ 



(25) S? = So+ / ~a v {r,S?)dr,+ / W{r,S^)dB T 

Jo Jo 

with a* 7 , b v defined as in Lemma 9. 

This equation makes sense, even if 77 is a generic rough path in C°' a , (in which case (24) is now really an 
RDE). Indeed, since the first two derivatives of <$P and its inverse are bounded (Proposition 11.11 in [16]) 
we have that a v (t, •), b v (t, •) are also in Lip 1 . Hence by Theorem V.7 in [22], there exists a unique strong 
solution to (25). 

We define the mapping introduced in Theorem 2 as 

S(ii) t :=0"(t,S?). 

To show continuity of the mapping we restrict ourselves to the case q — 2. Moreover we shall assume 
ci, . . . , Cd Y & Lip 5+e (R ds ), and we will hence prove the local Lipschitz property of the respective maps. 

Let T] 1 ,^ 2 £ C°' a with |r; 1 | Q _H6i, |^ 2 |a-H6i < R- By Lemma 12 we have 



E[sup|^ -S 2 S \ 2 Y' 2 < C^uWPu-wziv 1 ,* 2 )- 
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Hence 

E[sup (-(r, 1 ), - Hfa 2 ) a | 2 ] 1/2 = E[sup \<j>\a, Si) - 2 (s, S 2 )] 2 } 1 / 2 

s<t 

< E[sup |^( S , Si) - ^(s, S 2 )\ 2 }^ 2 + E[sup |^( S) S 2 ) - <j?{s, ~S 2 S )\ 2 Y' 2 

s<t s<t 

<E[sup\cj ) 1 (s,Sl)-^( S ,S 2 )\ 2 } 1 / 2 + sup \^{a,x)-t^{a,x)\ 

s<t s<t,xGTS. d S 

< K(R)E[snp \Sl - ~S 2 S \ 2 ] 1 ' 2 + C LemU (R)p a . K oi(v\r, 2 ) 

s<t 

1 „2\ 



< Cip a -H6i(ri , q 



as desired, where 



Ci = K Leml3 (R)C Leml2 (R) + C Leml3 (R), 
where K Lem i 3 (R) and C Lem i 3 (R) are the constants from Lemma 13. □ 

Proof of Theorem 3. In order to show (19), pick feel,..., ds- We first note that by simply scaling (the 
coefficients of) S v it is sufficient to argue for q = 1. And consider the fc-th component of S. 

Then 

E[exp(\E^(v)\oo;lo,t])} < E[exp(|D^U (V(0, S )\ + |S (fe);77 U ; [o, t ]))] 



< expQDtl>"\ 00 sup |0^(O, . z))E[cxp(|^| 00 sup \S { t k) ' v \)} 

|as|<|SoUoo s<t 

< expflZty'loo sup |^(0, x)) 

\x\<\S a \ L oa 

E[exp(\Drp r >\ 00 sup S^)} + E[- exp(|£>V"|oo sup S^)} 

s<t s<t 

= exp(|Z?V' 7 U sup |^(0, x)) 

|x|<|5o|iOO 

x (E[supexp(|£>^' I | 0O 5i fc)i ' J )] + E[-supexp(|£>V ,, |ooS? ); ' 7 )] 

\ s<t s<t 

Now, only the boundedness of the last two terms remains to be shown, for r] bounded. 
By applying Ito's formula we get that 

exp(S t (fe);T? ) = 1 + f eMS^dS^ + f eMS^WS^r 
Jo Jo 

<Ib r t 



Jo i=1 Jo 

+ £ rexp(5( fc )^)|62i(^ fc )«)| a dr. 

Hence the process exp(S^' v ) satisfies a SDE with Lipschitz coefficients and by an application of Gronwalls 
Lemma and Burkholdcr-Davis-Gundy inequality (see also Lemma V.2 in [22]) one arrives at 



(26) sup supE[exp(|i^|ooSf );?7 )] < Ccxp(C 2 ) 

l»?U-Hol<-R «<* 

where C is universal and 



C 2 := sup |5"| 0O + nL 

V-\\v\\ a -Hal<R 
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which is finite because of Lemma 10. One argues analogously for sup s<t E[— exp(|Z?-0 r, | oo S' f '' c ' ) ' T ')], which 
then gives (19). 

Now, for the correspondence to an SDE solution let il be the additional probability space as given in the 
statement. Let S be the solution to the SDE (18). 

In Section 3 in [5] it was shown (see also Theorem 2 in [1]), that if we let be the stochastic (Stratonovich) 
flow 

0(w; t, x) = x + / c(0(o;;r, #)) o dY r (u>), 
Jo 

then with St := _1 (i, St) we have P-a.s. 

(27) S B (u,u S ) = S + [ a(r,S r )dr+ [ b(r,S r )dB r , s £ [0,t], Pa.e. (id,co S ). 

Jo Jo 

Here, componentwise, 

a(t, x)i := d Xk Qr\t, Q(t, x))a k (Q(t, x)) + 9x^7% *)) E 6 J'0(*> x )) b kl(®& x )l 



2 



b{t,x)ij :=Y,d Xk Q7 X {t^)b kj {Q{t,x)). 



k 

Especially, by a Fubini type theorem (e.g. Theorem 3.4.1 in [3]), there exists fio with Po(f2o) = 1 such 
that for u> € fio equation (27) holds true P a.s.. 

Let Y e C°' a be the enhanced Brownian motion over Y. We can then construct id-wise the rough flow 
0Y(«) as givcn in (24). By the very definition of E we know that 5 t Y(w) (w) := ((p Y{u ' } )~ 1 {id;t,E(id)t) 
satifies the SDE 

(28) S? M = S + / S Y(w) (r,S Y ^)dr + / fo Y ^(r, Sj { ^)dB r , Pa.e. lo b , 

Jo Jo 

where 

a Y ^> (t, x)i := £ 9 Xfe (4?M)7 l (t, ^ (t, x))a k (t, cf> Y ^ (t, x)) 

k 

+ \J2 d ^ (^Jr 1 (*> (*. *)) E 6 * (*. ^ Y(w) (*> (*. <^ YM (*. *))> 

6 Y <-) (t, x) tj :=E^ (^ YM (*. (*. ^ M (*, *))• 
fe 

It is a classical rough path result (see e.g. section 17.5 in [16]), that there exists f2i with P Y (Oi) = 1 such 
that for id S f2i, we have 

Y <">( v ) = e(o;; v )- 



Hence for w £ (!i we have that a = 5, Y ^\ 6 = 6 Y ("). Hence for for id e Qo H Qi the processes 
St (a;, •), 5 Y (•) satisfy the sam 
have for w G flo fl fli that P a.s. 



S t (w,-),5 Y(t,,) (0 satisfy the same Lipschitz SDE (with respect to P). 10 By strong uniqueness we hence 



&(av) = S Y(w) (0. «<t. 

Hence for w G Oo (~l f2i 

S S ( U ,)=S(Y( W ))(.) S , «<t, Fa.s. 

□ 



Here one has to argue, that fixing uj in equation (28) gives (Po-a.s.) the solution to the respective SDE on fl B . 
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Remark 8. We remark that the above idea of a flow decomposition is also used in the work by Davis 
[10, 11, 12, 14]. Without rough path theory this approach is restricted to one dimensional observation, 
since one for multidimensional flows one cannot hope for continuous dependence on the driving signal in 
supremum norm. 

Lemma 9. Let n be a smooth dy-dim path rj and S be the solution of the the following classical SDE 

St = Sq + / a(S r )dr + / b(S r )dB r + / c(S r )dr] r 
Jo Jo Jo 

where B is a ds -dimensional Brownian motion, c(S r )dr] r := 53i=i So c i(^r)Vr ( ^ r ' a Lip 1 (R ds ), 
b 1 ,...,b dB G Lip x (R ds ), cx,...,c dy G Lip 4+c (R ds ), and S G L°°(Q;M ds ) independent of B. Consider 
the flow 

(29) <f>(t, x) = x + [ c(</>(r,x))dr} r - 

Jo 

Then St := (j)^ 1 ^, St) satisfies the following SDE 

S t = So + / a(r,S r )dr + / b(r,S r )dB r , 
Jo Jo 

where we define componentwise 



a(t,x)i ■=^2d Xk <j) l 1 (t,cf>(t,x))a k ((t>(t,x)) + \^ J d XjXh <j> i 1 (t,<f>(t,x))^2b :jl ((j)(t,x))b k i((j)(t,x)), 



j.k 



b(t,x)ij — ^dx^ 1 (t,cf)(t,x))b kj {(f)(t,x)). 



Proof. Denote ip(t, x) := cf> 1 (t, x). Then 

ip{r,x)=x— I d x ip(r,x)c(x)dr] r . 
Jo 

By Ito's formula 

Mt, S t ) - ^(0, So) = / OtMr, S r )dr + £ / d X] ^{r, S r )dS 3 {r) \ / d XjXk ^(r, S r )d(S k , Sj), 



o J Jo ~ k 2 Jo 

y2 d Xj tpj (r,S r )aj(S r )dr + y] / d Xj ^j(r, S r ) ^ b jk (S r )dB k (r) 
j J ° j Jo k 

1 /"* 

+ / d XjXk ipi(Sr)y2b k i{Sr)bji(Sr)dr. 



□ 



Lemma 10. Consider for a rough path r\ G C°' Q the analogously to Lemma 9 transformed coefficients, 
a* 1 , b 11 , i.e. consider the rough flow 

(30) (j>(t,x) = 4> n {t,x) =x+ f c((j>(r,x))dr] r . 

Jo 

and define 

a"(t, x)i := MT 1 ^ <K*> z))*k(<t>(t, x)) + ^J2 d ^T\t, <t>(t, *)) J2 x )) b " M*> 



2 

j,k 



b"a..r), : :=^% h <f>i 1 (t,<t>(t,x))b kj (<t>(t,x)). 
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Then for every R > there exists K Lem iQ = K Lem io(R) < oo such that 

sup \a r >\ ao < K LemlQ , 

V-\v\a-H6l<R 

sup < K Lemi0 , 

v-\v\o,-Hoi<R 

sup sup (Da^s, Oloo < K Lem iQ, 

r):\v\a-H6l<R 8<t 

sup sup [Db^is, - Jloo < K LemlQ , 

and smc/j i/*a£ if r\,r\ are two rough paths with \rf\ a —H5i, \v\a-H6i < R we have 

sup^^x) -h 2 {t,x)\ < K Leml0 (R)p a - H di(v\r] 2 ), 

t.X 

sup fb 1 (t, x) -b 2 (t,x)\ < K LemlQ {R) Pa ^H6i{ri 1 1 V 2 )- 

t,x 

Proof. This is a straightforward calculation using Lemma 13 and the properties of a, b. □ 

The following is a standard result for continuous dependence of SDEs on parameters. 
Lemma 11. Let a l {t,x), b l (t,x),i = 1,2 be bounded and uniformly Lipschitz in x. 
Let S l be the corresponding unique solutions to the SDEs 

S l t =S + [ a i (r,§i.)dr+ f b\r,S l r )dB r , i = 1,2. 
Jo Jo 

Assume 

sup \Dd 1 (s, ■) |oo j sup l-D&^s, -)|oo < K < oo, 

s<t s<t 

sup (a 1 (r, x) — a 2 (r, x)) , sup I b (r, x) — b (r, x) J < e < oo. 

r,x r,x ^ > 

Then there exists C L -t-i = C Lem i\{K) such that 

E[sup|^ 1 -5 s 2 | 2 ] 1 /2< Ciemll £j 

s<t 

Proof. This is a straightforward application of Ito's formula and Burkholdcr-Davis-Gundy inequality. □ 
We now apply the previous Lemma to our concrete setting. 

Lemma 12. Let r] 1 ,^ 2 € C°' a and let S 1 ^ 2 be the corresponding unigue solutions to the SDEs 

§i = S + [ a l (r 1 S l r )dr+ [ b l (r, S l r )dB r , i = 1,2, 
Jo Jo 



where a 1 , b l are given as in Lemma 1 0. 

Assume R > max^T? 1 \ a -H6U ^U-Hoi} ■ Then there exists C ieml2 = C Leml2(R)- 

E[ S up\Sl - S 2 S \ 2 ] 1/2 < C Leml2 P a -Hoi(v\v 2 )- 



s<t 



Proof. Fix R > 0. Let HVIU-hsi, ||»7 2 |U-H6i < R- 
From Lemma 10 we know that 

supl&^x) -b 2 (t,x)\ < K Leml0 (R) Pa - Hdl {r)\r] 2 ) 
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Analogously, we get 

supla 1 ^,^) -a 2 (i, x)\ < L 2 p Q _ H 6i(»7 1 , rf ), 

t,X 

for al 2 = L 2 (R). □ 

Lemma 13. Let a € (0, 1). Let 7 > — > 1, k £ {1, 2, . . . } and assume that V = (Vi, . . . , Vd) is a 
collection of Lvp y+k -vector fields on K e . Write n — (m, . . . , n e ) S N e and assume \n\ := m + • • • + n e < fc. 
Then, for all R > f/iere exist C = C(i?, |F| LipT +fc), A' = K(R,\V\ Up -,+k)) such that i/x^x 2 6 
C Q -™([0,t],GM(M d )) with max, | |x 8 | | a _ H 67;[o,t] < # *en 

sup |9 n 7r (l / ) (0,y ;x 1 ) - 9„7r (y) (0,yo;x 2 )| Q _ ffc . ; . [0 t] < Cpa-^x 1 , x 2 ), 

a „ /n „, .„2\-l| ^ n„ /„1 „2n 



sup |a n 7r (y) (0,y ;x ) - <9„7r (y) (0,yo;x ) | a _ ff5fc[0)t] <Cp a - H 6i{* ,x ), 
sup I^tt^O^x 1 )!^ 
sup |a il 7r (v - ) (0,y ;x 1 )" 1 | Q _ ire , [o t] < 



u 

j/06 



Proof. The fact that € Lip 7+fc (instead of just Lip 7+fc_1 ) entails that the derivatives up to order k 
are unique non-explosive solutions to RDEs with Lip] oc vector fields (see Section 11 in [16]). Localisation 
(uniform for driving paths bounded in a-H61der norm) then yields the desired results. □ 
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